Section: New Software and Platforms
KATS
Kaldi-based Automatic Transcription System
Keyword: Speech recognition
Functional Description
KATS is a multipass system for transcribing audio data, and in particular radio or TV shows. The audio stream is first split into homogeneous segments that are decoded using the most adequate acoustic model with a large vocabulary continuous speech recognition engine. In this new software, the recognition engine is based on the Kaldi toolkit, and uses Deep Neural Network - DNN - based acoustic models. An extra processing pass is run in order to rescore the -best hypotheses with a higher order language model.
-
URL: Available online on the A||go platform: https://allgo.inria.fr/app/loriasts_kaldi